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Abstract 



< 

We consider the problem of staffing large-scale service systems with multiple customer classes and multiple 

^^ dedicated server pools under joint quality-of-service (QoS) constraints. We first analyze the case in which 

Ph arrival rates are deterministic and the QoS metric is the probability a customer is queued, given by the 

'~i Erlang-C formula. We use the Janssen-Van Leeuwaarden-Zwart bounds to obtain asymptotically optimal 

^ solutions to this problem. The second model considered is one in which the arrival rates are not completely 

1—^ known in advance (before the server staffing levels are chosen), but rather are known via a probability 

^— H distribution. In this case, we provide asymptotically optimal solutions to the resulting stochastic integer 

I program, leveraging results obtained for the deterministic arrivals case. 

o 

\C 1 Introduction 

^' . . 

^D In this paper, we consider the problem of staffing large-scale service systems with multiple customer 

^-H classes and dedicated server pools for each class. Staffing is such systems must typically be done under 

^ quality-of-service (QoS) constraints and in our model the QoS metric is the probability that a customer 

j^ is queued (i.e., he must wait for service). In the classical M/M/n model, this probability is given by 

5—1 

^ the Erlang-C formula. Since we analyze multiple server systems, we formulate joint QoS constraints 

that connect the performance of all the server pools. Our first results pertain to such systems when the 

arrival rates are deterministic. However, our final goal is to examine systems in which the arrival rates 

are known to the system manager only through the joint distribution of the arrival rate vector. In this 

case, the joint QoS constraints become more interesting, since optimizing the staffing level under such 

constraints can rely crucially on the correlations between arrival rates for different classes of customers. 

The first primary contribution of the paper is to show that the Janssen-Van Leeuwaarden-Zwart bounds 

[TU] converge uniformly, under Halfin-Whitt-type scaling, to the Erlang-C formula (see Theorem Isl). The 

other main contribution is to introduce and solve problems in which the correlation between uncertain 

arrival rates plays an important role in staffing problems with QoS metrics. 

1 



The literature on staffing service systems goes bacfc to Erlang Irimself, in the development of the 
Erlang-C formula. More recent work has focused on developing approximations of various kinds and even 
this literature is quite voluminous. In terms of asymptotic approximations with deterministic system 
parameters, the fundamental inspiration for our work is Halfin and Whitt 7J which introduced the idea 
of a many-server limit. More closely related recent work is that of Borst et al. |4|. They consider many- 
server asymptotic approximations and use the Halfin- Whitt formula to approximately solve both the 



constrained and "dualized" formulations of their model (compare to equations (11) and (12) in Section 



2.2). A primary difference in this paper is that we use uniform convergence results to prove a stronger 
form of asymptotic convergence. In particular, they prove that the ratio of the approximate and exact 
costs converge to unity whereas we demonstrate that the difference of the costs go to zero. Furthermore, 
we also consider the multi-station case. The work in Janssen et al. [10. is also closely related to our 
results and in fact we use the bounds developed there to formulate our approximations. They consider 
the same single-station staffing problems as described here, obtaining stronger asymptotic optimality 
results than [4 via an approximation that refines the Halfin- Whitt formula. However, they also focus on 
the single-station case. 

Until recently, much of the literature on service system staffing focused on the models in which the 
systems parameters are known with certainty. However, with renewed interest in call center modeling, 
there have been significant efforts to incorporate parameter uncertainty, especially with respect to ar- 
rival rates. Harrison and Zeevi [8] were probably the first to explicitly consider multi-station staffing 
models with joint arrival rate uncertainty. They propose a staffing method based on a stochastic fluid 
approximation, but do not provide analytical results regarding the accuracy of this approximation. A 
follow-up paper, Bassamboo et al. [1] provides the rigorous justiflcation for the methods proposed in [8]. 
Bassamboo and Zeevi [3] present a data-driven version of the model considered in the two aforementioned 
papers. Bassamboo et al. 2] continue this stream of papers by performing a more nuanced investigation 
of news-vendor type solutions derived in [Sj , in the single-station case. 

Whitt [13] uses a different fluid model formulation to analyze a single-station system with both arrival 
rate uncertainty and staffing uncertainty due to absenteeism. Gurvich et al. i6^ investigate staffing a call 
center with multiple classes, QoS constraints, and uncertain arrival rates. However, the QoS requirement 
in their model are formulated via chance-constraints which insure that the requirements are met with high 
probability. Their paper contains a nice discussion of chance-constrained versus average QoS performance 
formulations. Finally, Kocaga et al. |llj introduce a single-station staffing and admissions control model 
with uncertain arrival rates. Their solution is to use a diffusion approximation to derive a square-root 
staffing rule in conjunction with a threshold-based admissions policy. 

The rest of the paper is structured as follows. In Section [2] we introduce the model and review the 
relevant mathematical background. The analysis of deterministic arrival rate problems is given in Section 
[3J In Section |4J we consider the random arrival rate case. 

In the sequel, we use the notation N = {1,2,...} and Z+ = {0,1,2,...}. As usual, (/){■) is the 



probability density function (PDF) of a standard normal distribution and $(•) is the corresponding 
cumulative distribution function (CDF). 

2 Model Formulations and Mathematical Background 

The basis for all of our models is the classic Erlang-C model, along with related approximations. For 
completeness we first review these standard results. This review also contains a key new result which 
shows that the bounds given in Janssen et al. [TO] actually converge uniformly in the QED regime to the 
Erlang-C formula. Numerical evidence indicates that the same type of result holds for the Halfin-Whitt 
approximation. However, proving this convergence seems more difficult and the upper bound given in 
[TO] is more useful, since it can be used to guarantee feasibility in problems with QoS constraints. 

2.1 The M/M/n Queue 

We consider a standard M/M/n queue operating under the first-come-first-served (FCFS) service disci- 
pline. The arrival rate is denoted by A and the service rate by ^. Without loss of generality, we assume 
/i = 1. Of course in this case the traffic intensity is also equal to A. Let Q be the stationary system size. 
A classical result is that Q has the following distribution: 



= k] = 
where 77 is a normalizing constant: 
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for /c = 0,1,2,. 
fc = n, n + 1, . . 
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V(A)^ (A)" 

^ fc! n\{l~X/n) 

The result above allows us to compute V{Q > n}, which by PASTA is equal to the stationary 
probability that a customer waits to receive service. This leads immediately to the Erlang-C formula, 
which we express as a function of the number of servers and the arrival rate: 

(A)" 



a{n, A) 



V- 



'nl{l~X/n)' 

In order to facilitate the analysis in later sections, we also use the Jagers-Van Doom [9] continuous 

extension of the Erlang-C formula: 

-1 



a{n, A) := 



A / te 
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{l + t)"-^dt 



(1) 



where n now is any non-negative real number. Hence, a : R,_|_ — >■ [0, 1]. 

In later sections, we use asymptotic analysis of the M/M/n queue in which the arrival rate and the 
number of servers both grow large. When these quantities grow large together in a specific manner, this 
is referred to as the Halfin-Whitt regime, due to the results below. 

Halfin and Whitt 7^ consider a sequence of M/M/n queues, indexed by the number of servers, n. 
The arrival rate in system n is denoted by A„. As n increases the scaling of A„ is such that the traffic 
intensity p„ :— -r- approaches 1 (recall /x = 1). For completeness, we now restate their classic result. 



Theorem 1. (Halfin and Whitt [7]) Consider a sequence of M/M/n queues with arrival rates A„, n 
1,2,.... As n — )■ oo, a{n, A) converges to a constant a with < a < 1 if and only if 



for some /3 > 0. // ([2| holds, the 



f^{\ - pn) -)■ P 



1 



a 



(2) 



(3) 
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Theorem IT] implies that when the system is large enough (|3|, which is called the Halfin- Whitt approx- 
imation, approximates the Erlang-C formula well. 

In [lOj . Janssen et al. provide new bounds for the Erlang-C formula which turn out to be more 
analytically tractable than the Halfin- Whitt approximation. In subsequent sections, we use these bounds 
to build approximate staffing models and obtain asymptotically optimal solutions. Hereafter, we refer to 
these bounds as the JVLZ bounds. 



Theorem 2. (Janssen et al. ^U\) Let p = \/n and 

a — y— 2n(l — p + Inp), 

/3=(n-A)/yA, 
7 = (n - A)/^/r^ = P^. 



For n > X, 



and 



a{n, A) < 



P + 7 



Ha) , 2_ 
(f>{a) 3y/n 



a{n, A) > 
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(a) 12n-l , 

Using equation ^ we can express the continuous Erlang-C formula and its bounds in terms of P and 
A. Define 

r5(/3,A) ^a{X + f3VX,X) 

as the continuous Erlang-C formula with respect to /?, where a{-, •) is defined in equation M. Notice 
that the formula above implies that for a particular arrival rate A, that a square-root safety staffing level 
A -I- (3\/X is used. 

Now, let UB{f3, A) represent the upper bound on the Erlang-C, as given on the right-hand side of 
inequality ([7]), and let LB{f3, A) represent the lower bound, as given on the right-hand side of inequality 
([8]). One then might hope that if A grows large and square-root safety staffing is used, then the upper 
and lower bounds converge to the Erlang-C formula. In fact, this is already known to be true via the 
Halfin- Whitt result. What we demonstrate here is that the convergence is actually uniform in the square- 
root staffing factor /3. This uniform convergence is useful for proving asymptotic approximation results 
for the optimization problems we formulate in later sections. 



Before we present the main result we state a series of lemmas needed to establish the primary con- 
vergence result of this section. The proofs of the next five lemmas are given in the Appendix. 

Lemma 3. For positive X, UB(f5, A) is strictly decreasing in A for any fixed /3 > 0. 

Lemma 4. Let n = X + (3yX and let 7 be defined as in ([6]). Then 7/ (12n — 1) converges uniformly to 

0, in /3, as A — > cx). That is 

7 
lim sup — — 0. 

A-foo ^>o 12n — 1 

Lemma 5. Let n = X + /3vA and let p, 7, and a be defined as in TheoremvA Then p(p{a) + 7$(a) is 
strictly increasing in j3 for any sufficiently large X. 



Lemma 6. Let n = A + /?v A and let p, 7, and a be defined as in Theorem\^ Then p(t>{a) + 7$(a) + 

270(a) 1 7 

3^/n ^ (12n-l) 



"''^"'^ -1 ^i— pr is uniformly bounded away from for all sufficiently large A, and all /3 > 0. Specifically, 



inf p0(a) + 7$(a) + ^^^ + .^^ ^. > 0, (9) 

for M sufficiently large. 

Lemma 7. For positive /3, UB{/3, A) is strictly decreasing in /3 for any fixed A > 0. 

Finally, we are ready to state and prove the main convergence result of this section, Theorem |8] This 
result plays a prominent role in proving the asymptotic optimality results in Sections [3] and |4] 

Theorem 8. The upper bound of (l7| and lower bound of ([8]) converge uniformly, over positive f5, to the 
Erlang-C formula as A —> 00. That is 

lim sup [a(/3, A) - LB(/3, A)] == 



and 



lim sup [UB{li, A) - a{l3, A)] = 0. 



Proof. To prove the uniform convergence, we only need to show that the upper bound, UB{/3,X), uni- 
formly converges to the lower bound, LB{/3,X), i.e., 

lim sup [UB{P, A) - LB{/3, A)] = 0. 

A-i.oo^>0 

With n = X + P\/X, we have 

UB{(i,X)-LB{p,X) ^ ^ 
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From Lemma [7J we have that 






P + ^[^ + 37^ 



is strictly decreasing in /3 for any fixed A > 0. Furthermore, it is easy to verify that 

-1 



Thus to show 



it suffices to prove 



and 



P + 7 



Ha) I 2 
0(a) S\/n 



I VA>0. 



/3=0 



hm sup C/B(/3, A) - LB{(3, A) = 0, 

A-i.oo^>0 



hm sup 7-— — = 0, 

A-s-oo ^>o (12rt — 1) 
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A>S>o^^")(,^+7(^ + V^ + 0(a)(12n-f) 

These conditions hold by Lemma |4] and Lemma [6j respectively. 



>0. 



2.2 Model Formulation 

We now introduce the single- and multi-station models of interest in this paper. The manager of these 
systems is concerned with both staffing costs and quality of service. In this paper, we use the probability 
that a customer must wait to receive service to measure the quality of service and model the trade-off 
between the staffing cost and this probability. We first consider a service center modeled by a single 
M/M/n queue and then examine systems with L parallel M/M/n queues, as depicted in Figure [I] 






Figure 1: Multi-station System 



We begin with the single-station system. Because of the manager's competing measures, we face a 
bi-criteria optimization problem, in which we want to simultaneously minimize the staffing cost and the 



probability of inducing customer waiting. Let c{n) be the staffing cost function, and assume c(n) is 
strictly increasing in the staffing level n. Next, for an M/M/n queue with arrival rate A, let W{n, A) be 
a random variable corresponding to the stationary delay (waiting for service). Our bi-criteria model for 
this call center problem is: 

vmin„gz+ [c(n),P{VF(n,A) >0}], (10) 



where "vmin" denotes vector minimization and a solution of model ( 10 ) corresponds to the family of 

staffing levels that falls on the efficient frontier. Of course, P {W{n, A) > 0} is equal to the steady-state (or 

long-run) probability that a customer must wait for service. By PASTA this is equal to P {Q{n, X) > n}. 

In general, we are interested in asymptotic solutions to this bicriteria model. We consider a sequence 



of problems of the form (10) with A — > oo. As A goes to oo, the staffing level n also goes to oo as does 



the staffing cost c{n). Hence, we need to reformulate (10) to obtain a well-posed model. We use the 
square-root staffing discussed earlier in the context of the Halfin-Whitt regime. Next, replace the staffing 
level n in (10) with A + /3\/A and rewrite the model using the decision variable /3: 



vmin^>o [c(/3), P {W{/3, A) > 0}] , (11) 

where c(/3) is the cost function parameterized in /? rather than in ri and W{l3, A) is again the steady-state 
delay. Thus, we reformulate the decision problem as one of choosing the safety-staffing parameter /3, 
rather than the number of servers. We assume c(/3) is continuous and strictly increasing in its argument. 



and hence for any fixed value of A, models (10) and (11) arc equivalent. (Recall, our goal in these bi 



criteria models is to form the efficient frontier of solutions.) Moreover, the optimal value of (3 does not 



grow large as A grows large, and hence the asymptotics associated with model (11) have finite limits 



One way to solve model (11) is to make one component of the objective function a constraint. For 



example, we can solve the bi-critcria problem by solving a family of models: 



min c(/3) s.t. P{M^(;9, A) > 0} < e, (12) 



parameterized in the risk level threshold, e, where < e < 1. 



For each e, by solving model (12), we obtain an optimal /3. The staffing cost and the probability of 
waiting corresponding to the optimal (3 give one point on the efficient frontier of the bi-criteria problem. 
By varying e from to 1, we obtain all the points on the efficient frontier. 



Another way to solve model (11) is to use a weighted objective function approach. That is we solve 



the bi-criteria problem by solving the family of models 



min c{/3) + 5¥{W{(3,X)>0}, (13) 



parameterized by S > 0, the weight on the second term in the objective function. Solving model (13 1, by 
varying S, we obtain all the extreme points of the convex hull of the efficient frontier (cf. [12] )■ 

We use the example given in Figure [2] to explain the relationship between model ( |l2| ) and model (13 1. 
In Figure [2j we assume the points p, g, r, s and t correspond to Pareto efficient solutions to the bi-criteria 



s •, 



\p. 
\ 



risk 



Figure 2: An Efficient Frontier 



problem under consideration. If we use model (12 1 to solve the bi-criteria problem, then by varying e, 



we achieve all solutions on the efficient frontier. So, we achieve all five points, p, q, r, s and t. On the 



other hand, if we use model ( 13 ), unless the efficient frontier is convex, we do not achieve all five of these 



points. However, the solutions which are extreme points of the efficient frontier are achieved. In other 



words, points s, q and t are achieved by solving model ( 13 1 and varying 5 



Model ( 12 1 directly describes what is typically viewed as the practical need. Generally, service center 



managers try to find a staffing level that minimizes the staffing cost while maintaining a certain service 
level. However, in our view there is insight to be gained by forming the efhcient frontier to better 
understand cost-quality tradeoffs. This is particularly true when contractual service levels have not yet 



been determined. In what follows we use either model ( 12 ) or ( 13 1 to present results, depending on which 
is more convenient. 

We now extend our model to a multi-station system as depicted in Figure IT] Suppose we have L 
M /Ad /n queues in parallel. Station i, i = 1, . . . , L, has arrival rate A^ and rii servers, determined by j3i 
via the square-root staffing rule. Then we formulate: 



mm 

/9>0 



L 

E 



c.(A) + 5P' 



U{PF,(/?„A0>0}1, 



(14) 



where, /3 = (/3i, . . . , (3^) and Wi{(3i, Xi) is the stationary delay at station i. The second term in the sum 
above again incorporates costs related to quality of service. We assume that Ci{j3i) is continuous and 
strictly increasing in Pi for i = 1, . . . , L and that the L stations operate independently, i.e., the arrival 
and service processes are mutually independent. Then the above model is equivalent to: 



mm 



L 

E 

i=l 



c,{Pi) + 5 



l-\{{l-V{WdP^.\)>Q})\ 
\ J=l / 



(15) 



3 Deterministic Arrival-rate Problems 

In this section, we formulate approximate versions of the optimization models presented in the previous 
section, using the JVLZ bounds. We prove asymptotic optimality of the approximate solutions for single 
station, and then multi-station models. These results are primarily stepping stones for the asymptotic 
optimality results in Section HI where we analyze the case of random arrival rates. 

3.1 Single-station System 



We begin with the single-station system and consider model (12): 



min c{l3) s.t. P{W^(/3, A) > 0} < e. 

Recall that the probability in the constraint is given by P{VK(/3, A) > 0} = a(/?. A). Next, for any fixed 
A, define model F\ as: 

min c(/3) s.t. a(,9,A) < e. (16) 

As A varies, we obtain a sequence of models {^^a}- The Erlang-C formula can be numerically unwieldy, 
especially when the arrival rate grows large. So we build an approximate model by replacing the Erlang-C 
formula with the JVLZ upper bound, UB{I3, A), defined by the equation on the right-hand side of (l7|, 
except that n is replaced by A -f /3vA. Using UB{j3, A) we define our approximate model G\ as: 

min c(/3) s.t. UB{P,X) < e. (17) 

Notice that any feasible solution of model G\ is also feasible for model F\ . 



Theorem 10 establishes the asymptotic optimality of using solutions of Gx to solve model Fx- The 
theorem is stated in terms of convergence of the decision variables. Since the cost function is assumed to 

we 



be continuous, this also implies convergence of the objective values. Before turning to Theorem 10 
first provide a supporting lemma. 

Lemma 9. Let A > 0, a(n. A) be as defined in (n]), and set a{f3,X) — a{X + /iyX, X). Then a{f3,X) is 
strictly decreasing in (3 for any X, (3 > that satisfy X + /3yX > 1. 

Proof. To prove (5(/3, A) is strictly decreasing in /3, it suffices to show that a{n, A) is strictly decreasing 
in n. Jagers and Van Doorn [S] prove that a(n, A) is convex in n. Also, we know that a(n. A) is strictly 
decreasing in n on the positive integers. This implies a{n,X) is strictly decreasing in n for all n > 1, 
otherwise its epigraph is not convex. D 



Theorem 10. For A > 0, let the optimal solution of Fx, as defined in (16), be /3^ and let the optimal 



solution of Gx, as defined in (17), be f3'^ . Then /3^ > /3^ , VA > 0, and there exists a finite /3* such that 



lim PX = lim K = /3*. 

A— *-oo A— ^oo 



Proof. The objective function, c(/3), is strictly increasing in /3 and by Lemmas m and ^ a(/3, A) and 



UB{/3,X) are strictly decreasing and continuous in (3. Hence, the unique optimal solution of models (16) 



and (17) are defined by requiring the respective constraints to hold with equality. That is, /3^ solves 
a(/3. A) = e and /3^ solves UB{/3,X) = e. We know that UB{I3,X) and a{f3,X) have range (0,1] and we 
have 

UB{0,X)^a{0,X) = l 

and 

lim UB{P,X)= lim 5(/3,A)==0. 

Also UB(j3, X) and a(/3, A) are continuous and strictly decreasing in /3 on [0, cxd). So, the optimal solutions 
(3^ and (3^ exist and are unique for any e > and A > 0. From Lemma pH we have (3^ > f3^ > 0, for any 
A2 > Ai- This indicates lim^-i-oo l^x exists and is finite. Let lim^-i-oo P\ — P* ■ Since UB{f3, A) > a{/3, A) 
for any f3 > and A > 0, we have /3^ > /3|^ for any A > 0. This together with the fact that {(3f} is 
a bounded sequence, indicates that {/3^} is a bounded sequence. So {(3^} has at least one subsequence 
that has a finite limit. For any subsequence {f3y} with a limit and its corresponding limit /3, we have 

lim a{l3, A') — e. 

A'— >-oo 



Also, for any /3, we have 



This indicates that 



lim (a(/3, A) - C/B(/3, A)) = 0. 



lim UB{J3,X')^ lim a{J3,X!) = e. 

A'— foo A'— foo 



Since \iuYy^aaUB{(3* ,X') — e, and there exists a unique (3 satisfying \iuY\i^rx,UB{j3^X') — e, we have 

?;^} have the same limit point, (3* . Thus lim^_^oo P\ 



that (3 = (3* . This implies that all subsequences of {(3^} have the same limit point, (3* . Thus limj^^oo /^f" 



exists and is (3* . In other words. 



lim /?^ = lim /3f = /3* 

A— VOO A— >-CX3 



3.2 Multi-station System 



We now extend our development to a multi-station system and return our attention to model (15): 

mm ^c,(A)+qi-n(l-P{W,(/3„A,)>0})j . 
i=i \ i=i / 

As in the single-station system, we formulate an equivalent model using the continuous Erlang-C formula, 
and we again denote this by model Fy. 



mm ^c,(A) + 5(l-[](l-a(A,A,))) 

4=1 \ i=l / 



(18) 



10 



Following an analogous development to our single-station system, we build an approximate model for 



(18) by using the JVLZ bound UB{f3,X). The approximate model Gx is: 



i-n(i-c/i?(A,A.))]. 

^ i=l ) 



mm X^c,(/30 + Ml-||(l-f^^(A,A.))l. (19) 



Denote the objective function of model F\ as ]\[)., and the optimal solution of F\ as the L-vector 
/3|^. Similarly, denote the objective function of model G\ as g\{-), and the optimal solution of G\ as 



/3^. Theorem 12 implies the asymptotic optimality of solutions to the approximate model as the arrival 
rate vector grows large. We let the arrival rates grow in the following way. Assume there are initial 
values of arrival rates for all queues. Let the initial vector of rates be A° — {X^, . . . , A° )• Indexing the 
sequence of systems under consideration with positive integers, assume the arrival rate for the ttz*'' system 
is A™ = toA*^ . Then as rn — )■ oo the components of A™ grow large together. 
The lemma below, needed for the main result, is proved in the Appendix. 



Lemma 11. Let fmi:) denote the objective function of model F\ as defined in (181, with arrival rate 
A™ = TT^A*^. And, let gm{') denote the objective function of model G\ as defined in (191, with arrival rate 
A™. Then, 

Van sup {gm{P) - fmiP)) = 0. 

m-s-oo ^>Q 

Theorem 12. Let fm{-) denote the objective function and let /3,^ denote the optimal solution of model 



F\ as defined in (18 I, with arrival rate A™. Let gm{) denote the objective function, and let f3^ denote 



the optimal solution of model Gx as defined in (19), with arrival rate A™. Then 



lim (/™(/3,'rJ-/„.(/3,fj)=0. 

m— >C30 

Proof. First, since /3^ is optimal with respect to F\ we have fm{Pm) ~ fm{Pm) — 0- The objective 
function g„ is an upper bound for /„ and so we have gmiPm) > fmiP^)- Thus fm{l3^) ~ fmij3^) < 
9m,{Pm) ~ Imifim)- Wc also havc g-mW^) > gmWm)^ ^incc /3^ is optimal, with respect to G\ with arrival 

we have 



11 



rate A". This imphes that fmW^-,) ~ fmW^) < gmifim) - frn{Pm)- According to Lemma 

9rn{Pfn) " fm{.Pfn) ^- 0, as m -> oo. This proves that fm{P'^) - fm{P^) ^- 0, as m -> oo. Q 

Theorem [12] indicates that for the multi-station problem we describe above, our approximate solution 
is asymptotically optimal for any (5 > 0, in the sense that the absolute gap between the optimal objective 
value and the approximate objective value goes to 0. Notice that Theorem [TOJ for the single-station case, 
is slightly stronger, in that it implies that both the approximate solutions and the approximate objective 
values converge to the respective optima. 

We could also build the approximating problem using the lower JVLZ bound, Li?(/?, A), to replace 
the Erlang-C formula in the original model F\. In this case, we can again obtain a result analogous to 
Theorem |12| We prefer to employ the upper bound rather than the lower bound because the solution 

11 



under the former approximation is appropriately conservative, i.e., it is guaranteed to be feasible for 
model Fx, while the solution under the lower bound is not. Finally, one could also build approximating 
problems using the Halfin-Whitt formula. However, the required uniform convergence result seems harder 
to establish. 

4 Stochastic Arrival-rate Problems 

In this section, we extend models from Section [3] to include arrival-rate uncertainty. That is, we focus 
on solving large-scale staffing problems when the arrival rates are uncertain in addition to the inherent 
randomness of the system's inter-arrival times and service times. In particular, we consider a decision 
making scheme in which the manager must select staffing levels before observing the arrival rates. How- 
ever, the decision maker does have complete distributional knowledge of the arrival rates. Such a model 
reflects typical practical situations in which the staffing schedule must be determined in advance, with 
only a forecast of possible daily call volumes in hand. The manager's goal is now to minimize staffing 
costs while meeting QoS levels averaged over time. We represent this by taking an expected value over 
the possible arrival rates. 

In the setting where the state space of possible arrival rates is discrete, we show that as the system size 
grows, there is at most one key scenario under which the probability of waiting converges to a non-trivial 
value, i.e., a value strictly between and 1. In any other scenario, the probability of waiting converges 
to either or 1, that is the staffing level is either over- or under- loaded in any scenario other than the 
key scenario. Exploiting this result, we propose a two-step solution procedure for the staffing problem 
with random arrival rates. In the first step, we use the desired QoS level to identify the key scenario 
corresponding to the optimal staffing level. After finding the key scenario, the random arrival-rate model 
reduces to a deterministic arrival-rate model. In the second step, we solve the resulting model, with a 
deterministic arrival rate, by using our approximation model proposed in Section [3] The approximate 
optimal staffing level obtained in this procedure converges to the true optimal staffing level for the random 
arrival-rate problem as the system's size grows large. 

4.1 Single-station System 

As before, we first analyze the single-station system and then turn to the case of L parallel M/M/n 
queues. So, in the single-station system, let A denote the random arrival rate. Let A" be a specific 
realization, where uj is an outcome, or scenario, from the sample space fl. We assume that fl is finite. 



Let p" be the probability assigned to scenario w. A naive attempt to extend ( 12 1 to the doubly stochastic 
setting results in the following model: 



c(/3) s.t. ^p'^P{VK(/3,A)>0| A = A'^}<e, (20) 

where P{W{(3,A) > | A = A'^} = (5(/3, A"). 



mm 

/3>0 
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However, there is a fundamental shortcoming in this formulation. In our desired stochastic program 
the staffing decision made at time should be nonanticipative; i.e., it cannot depend on a realization of 
the randomness not yet observed. However, in this formulation, the number of servers does depend on lo 
in that the number of servers varies by ui via A"^ + /3vA". In order to rectify this, the decision at time 
must consist of both choosing the square-root staffing factor /3 and a specific scenario 10^^^. Therefore, 



the number of servers chosen is given by A" " + /3v A"'"^'' which is not dependent on the outcome w. It 
turns out that such a scheme still enables us to produce asymptotically optimal solutions to the staffing 
problem. 



So, we revise the extension of model (12) as follows and denote the model Ft^: 



min c{p,,J"'y) s.t. y p"P{I^(^,A) >0| A = A"}<e. (21) 

/3>0.cj''=!'Gf2 ■'He 

wen 
To facilitate asymptotic analysis, we need to properly define how A grows large. Let the initial value 
of the arrival rate in all scenarios be Aq = (Aq\ . . . , Aq'*^'), where, without loss of generality, we assume 
the components of Aq satisfy Ag^ < Aq^ < • • • < Aq'*^'. Next, assume that the arrival rate in the to*''. 



TO- e N, system is A.,„ ~ /tiAq. We then let to — ?► 00. Define now (21 1 with A^ as Fa^: 



mm 



c(/3,a;'='=") s.t. V p"P {W(/3, A™) > | A^ = A^} < e. (22) 

The objective function c(/3,a;) is assumed to have the following property: c(-,aj) is strictly increasing and 
continuous for all w € fi. 

As implied by Theorem [T] when A is deterministic the probability of waiting has a non-degenerate 
limit if and only if the number of servers, n, increases in such a way that n = A + fi\f\ for some /3 > 0. 
In model (22), the number of servers n, or equivalently (/3, w*^''*'), is chosen before we see the realization 
of the arrival rate. For a given staffing level (/3,cj'^'^^), scenario uj^^^ is the only scenario for which the 
limiting probability of waiting is strictly between and 1. In other scenarios, the system is either over- 
or under-loaded for the chosen staffing level as the arrival rate grows large. We thus obtain the following 
corollary of Theorem [T] 

Corollary 13. For a given stajfing level specified by f3 > and ijj^'^y , we have 

hm a{KC' +pJh!^\KC') G (0,1). 

m— >-oo V 

For all to ^ Q such that cu ^ cu^^'^ and Aq > Aq , we have 

for all LJ ^ il such that uj ^ uj^^-y and Aq < Aq , we have 

lim a{K^T' +pJa!^\K:^)^0. 

m— >oo V 



Consider the constraint of model (221, for a specific decision ^ > and uji = uj^^^ . Then, we have 



lim aiKrl + /3VA^.% A-0 e (0, 1). (23) 
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Suppose we approximate the QoS constraint in ( 22 1 by 



|n| 

J2 P'^" + P"'P {W{P, A„0 > I A„, =Kl}< e. (24) 

k=i+l 

This approximation replaces P {W{P, A^) > | A^ — K^} by miity for uj ~ i^i+i, • • • , <^|n|i and by zero 



for uj — ujit . . . , Wi-i- In view of Corollary 1 1 3| and equation (23), this approximation becomes increasingly 
precise as m grows large. 



Equation ( 24 ) and the structure of c(/3, uji) suggest that for sufficiently large m we should select the key 
scenario by finding the scenario uJi such that X^LJi P'^'' — ^ ^^'^ J2k=i+i P'^'' "^ e, for i G {1, 2, . . . , |il| — 1}; 
if p^if^i > e, then we select W|n| as the key scenario. In our work, we do not consider the trivial situations 
where e = or e := I. Thus, this mechanism for selecting the key scenario yields a unique w,. Given the 



key scenario oji, we form a first approximation to model (22) as 



min c(/3,a;,) s.t. p'^^V {W{p, A";^) > 0} < e - 2^ p""- \ ■ (25) 

' k=i+l 




The term P {W [13, AJ"^') > 0} in model 1^ is calculated by the Erlang-C formula, a(A^„'+/3VA™' , AJ^^'). 
We can use the upper bound UB{/3, Af^) to approximate P {W(J3, Af^) > 0} and build our approximating 
model with uji which we denote G\^ : 

/ M \ 

mm c(/3,c^,) s.t. p"'C/i? (/3, A^*) < e- J] p"M • (26) 

\ k=i+l J 

Our next goal is to extend Theorem [lO] to the doubly stochastic case considered in this section. First 



we need Lemma 15 which is proved in the Appendix. That lemma relies on a classic result from real 



analysis, stated below for completeness. 

Theorem 14. (Buchanan and Hildebrandt [5 ) If a sequence ,fn{x) of monotonic Junctions converges to 
a continuous Junction f{x) in [a, b] then this convergence is unijorm. 

In the lemma below, we extend to the models in Section [3] to include constraints whose right-hand 
sides are also functions of A. 

Lemma 15. Let A > 0. We extend model (16) in Section[Mto 

min c(/3) s.t 5(/3, A) < eA, (27) 

and denote its optimal solution by f3^ . We also extend model (17) in Section\3\to 

min c(/3) s.t. UB{/3,X) <ex, (28) 

and denote its optimal solution by /3^ . Here the right-hand side e\ satisjies limA-s-oo £\ — e > 0. Then 
f3^ > j3^ , VA > 0, and there exists a finite /3* such that 

lim /3^= lim /3f = /3*. 

A— )-oo A— >-oo 
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In the following result, we use Lemma [15] to infer that the gap between the optimal solution for model 



(22 1 and the optimal solution for model (26) goes to as the system size increases, i.e., the approximating 



solutions are also asymptotically optimal in the doubly stochastic model. 

First, we introduce some notation. Let (o;^,/?^) be an optimal solution to model -Fa„ as defined in 



(22 1. For any m, there are multiple such solution pairs, since the same staffing level can be achieved by 



different combinations of ujm and /3m- However, once ujm is chosen, there exists but one optimal /3m- For 
a fixed m, let W^ be the set of all optimal tOm- 



Theorem 16. Let /3^ be an optimal solution to model Ga^ as defined in (26). Assume e G (0, 1) is such 
that there exists an i with X^UiP"*" -^ ^ '^'^'^ X^LJi+i?'"'" ^ ^- Then, there exists an ffi such that for all 
m > fh we have uji € W^. And, there exists a /3* > such that 



hm pg^ hm /3,f, = /3*, 



where the (3^ are the optimal staffing factors for (22) with uJm = i^i for all m. 



Proof. In what follows we use lu < oj' to mean Aq < Aq . Since \fl\ is finite, from Corollary 
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we have 



lim max min <^ a(AJ:; + /3W A^;, cj), 1 - a(A^ + /3./ A^', cj) }■ = 0, 

for each fixed /3 > 0. 

Thus, given A > and a fixed /3 > 0, there exists an to, such that for all m > to 



max <; a(A„r + /3 V ^"" , w) J^ < A, 



and 



max <; 1 - a(Ar„'" + /3y Am"\uj) } < A. 



Hence, for all m > m, the left-hand side of the constramt in (22) is bounded above by 



E p 









and bounded below by 




(1 - A) +p'""a(A^'" +/3VAm"%a;,;;) 



(29) 



(30) 



Then for A chosen sufficiently small, we have the following conclusions. For any m > m, if w^ < (^i, (30) 
indicates a contradiction of feasibility for Fa^. If w^ > (^i, (29) indicates that model (22) is feasible for 



all TO > m. However, model (22) is also feasible in the uji case for all m> m. This indicates that there 



exists an m, such that for all ?7t, > to we have uji G W^. Then, for all m > m, with Wi fixed, we can 



re- write (22) as a deterministic model as follows: 



|S2| 



mm c{p,LOi) s.t. p'^^¥{W{/3,A';::) > 0} < e - ^ p"-" + A„ 



(31) 
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Here liram^oo ^m = 0. Applying Lemma [15] we have 

lim /3G^ lim /3,f, = /3*. 



Due to the continuity assumption on the cost function, the resuh above also implies that the objective 
function values converge. 

4.2 Multi-station Systems 

We now consider a multi-station system, again assuming that there are L queues, whose dynamics are 
conditionally independent. We define A = (Ai,...,Al) to be the random arrival rate vector. Let 
A" be a specific realization, where cj — (cji, . . . ,0;^) is a sample point from the finite sample space 
H. = rii X ■ ■ ■ X 57^. Let p'^ be the probability assigned to scenario ui. We consider the model below: 

L ( L ~1 

min Vc,(A,wf') s.t. Vp"P<^ U{W^,(ft,A,)>0} 

which is equivalent to the following model: 

L 



A - A" ^ < e, (32) 



min Vc,(A,c.f^) 

/3>o,ui>"yen ^-^ 



i=l 
L 



(33) 



s.t. ^ P" n ^ {^^(^*' ^^) = 0| A, = Af } > 1 - e. 



Lj^n i—i 
where 



P{W^,(A, A,) = 0|A, == Af } = 1 - a(A^"" +/3,VA^"", Af). 

Again, the cost function Ci(-, •) is assumed to have the following property: Ci{-,uj) is strictly increasing 
and continuous for all w G 57 for i = 1, . . . , L. 



Facing this random arrival-rate model, we may think that instead of solving the joint model (33), it 
would be easier to solve several single-station models. That is, we can treat the L queues individually, 
and obtain the optimal staffing policy for each queue separately. For example, instead of solving model 



(33), we consider solving the following set of individual models, indexed by i: 

min Ci(/3i,i:j/''^) 

s.t. Y. PTP{W^{f3^,^^) = | A, = A^' } (34) 



iOiGili 



> VT^, i = 1,2,...,L, 
where p"' is the marginal probability of scenario w.^ e Qi. 



When decomposing the joint model (33) into the models in (34), it is difficult to decide on the right 



hand side of the constraint in each individual problem. In ( 34 ) , we set the right-hand side of each to 
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be \J\ — e with the notion that the service level should be the same across the individual stations if 
there is no managerial reason to favor one station over another. Although decomposing the problem in 
this manner improves tractability, the resulting solution may be undesirable, as shown in the following 
example. 

Example 1. Let L — 2 and consider the M/M/n system as shown in Figure\^ Suppose each queue has 
random, arrival rate Ki, i — 1,2. Assume there are two scenarios for the arrival rate of queue 1 (high 
and low) and there are three scenarios for the arrival rate of queue 2 (high, medium and low). The joint 
probability distribution is given in Table [71 The realizations of A for each queue under each scenario are 
given in Table \^ We assume a linear cost with cost coefficients ci — $5, ci = $3 and a service level 
threshold value of e — 0.05. 



Ai 



$ 5/server ( n^ 





n2 1 $ 3/server 



Figure 3: Example [T] System 



Table 1: Joint Probability for Example fT] 


p{^l,'^2} 


UJ2 — high 


UJ2 — medium 


UJ2 — low 


oJi = high 


0.03 


0.21 


0.1 


LOl = low 


0.01 


0.17 


0.48 



Table 2: Arrival Rates for Example [l| 



high 



Ai for queue 1 450 
A2 for queue 2 300 



medium 



low 



NA 



350 

loo" 



For the given parameters we solve the exact (non-asymptotic) versions of the corresponding models 
as given in ( 33 1 and ( 34 1 . From the solutions in Table [s] we can see that while both achieve the same 
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service level, the cost of the staffing policy from the decoupled models (34) is about 5% more than that 



of the joint model (33 1. 





Table 3- 


Solution Comparison 






exact model (33) 


decoupled models (34) 


n 


{nl = 496, n*2 = 235) 


(ni =484,rT2 =306) 


cost (ciTli + 0211,2) 


3185 


3338 


Ea 


¥l\jf^^waiU{n,,A,) > o| 




0.05 


0.05 



We now discuss how to formulate an asymptotics for joint model (33). Again, we consider a sequence 



of queueing systems with increasing arrival rates. The asymptotics of the arrival rates are similar to the 
scheme we considered in the single-station system. We assume there is a base value of the arrival rate 
in all scenarios. Let this base rate be A° = (A° \ . . . , A'^ '"'), where each A" '' ,k = 1, . . . , |r2|, is an 
L- vector, since it represents the base arrival rate the for i-station system in scenario ujk- For ttt, e N let 
the arrival rate in the tti* system be A^ — toAq and again let ?tj — >■ 00. 

As in the single-station system, the manager must pick the square-root safety factor /3i,i = 1, . . . , L 
for each queue, before the realization of Ai,i = 1, . . . ,L. Thus once the staffing factors are fixed, then 
in at most one scenario, u ^ — {uti^^ , . . . , w^'^*'), the probability of waiting in each queue converges to 
a value strictly between and 1 for large arrival rates. That is we can find a key scenario o;'^'^^, such 
that Vw 7^ oj'^'^y and Vfc G {1, . . . , L} if A^*" > ^fe'' i the limiting probability of not waiting for service in 
queue k under this scenario is 0; and, if A^'' < A^,*" , the limiting probability of not waiting for service in 
queue k under this scenario is 1. Thus, if a key scenario can be identified, the random parameter model 
reduces to a deterministic model. However, unlike the single-station system, it is not easy to identify a 
key scenario. Just as a multivariate distribution does not have a unique quantile, in this problem, the 
key scenario need not be unique, since the QoS "risk" can be spread in a number of ways. If one knows 
in advance how to allocate the probability embodied in e to each station, then it is relatively easy to 
identify the key scenario and for any such allocation, the existence of a key scenario for each station is 
guaranteed using the previous single-station arguments. In theory, an integer programming model can 
be created to find the allocation of e which minimizes the staffing costs. 

We are now ready to present the stochastic arrival rate versions of the results in Section |3.2[ As in 



the deterministic rate case, we ffi'st formulate the "dualized" version of (33), a model which we denote 
by Fa: 



mm 



L 

E 



key 



c.(ft,u;r') + 5 1 



(jj£fl 



p'^nP{H^,(A,A,0 = O|A, = Ar} 



j=i 



As before, the corresponding approximate model is denoted G\ 



mm 

j3>o,Lo''^yen 



i-Ep"jn(i-f^^(/3-Ar))|) 



(35) 



(36) 
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With these formulations, we now have the following extension of Lemma [TT] 



Lemma 17. Let fm{-, •) denote the objective function of model Fx as defined in (35), with arrival rate 
A™. And, let g„i{-,-) denote the objective function of model Gx as defined in (36), with arrival rate A™. 
Then, 



lim sup (5™(/3,c^'=^y)-/„(/3,u;^^^))=0. 



Since we assume that the sample space fl is finite, the lemma follows from minor modifications to 
the Lemma |ll| proof. We now present our final asymptotic optimality result, which relates the models 



presented in ( 35 ) and (136 



Theorem 18. Let fm{', •) denote the objective function and let {f3^,uj^) denote an optimal solution of 
model Fa as defined in (35), with arrival rate A™. Let (7m(-,-) denote the objective function, and let 



{f3^,UJ^) denote an optimal solution of model Ga as defined in (36 I, with arrival rate A'". Then 

lim (fmiPZ^nr) " /m(/3™,0) = 0. 
m—^oo 

The theorem follows by completely analogous arguments used to prove Theorem |12[ applying Lemma 
[TtI instead of Lemma [Tl] 

Finally, we use the data in Example [l] to illustrate how the key scenario idea can be used to solve a 
multi-station problem. It is obvious that the key scenario in this example is (^1,^2) = (high, medium). 
Otherwise the value of the left-hand side of the constraint in model (33) cannot exceed 1 — e. Also, 



with the key scenario being (high, medium), we can select {Pi, 1^2) to satisfy the constraint. Thus it is 
not necessary to consider scenario {high, high), which is more costly. After finding the key scenario for 
Example [Tl we can write model (33) for the asymptotic version of the original random rates problem as: 



min 5/3i -I- 3/32 

s.t. 0.21 (1 - 5(/3i, 450)) (1 - a(/32, 200)) 
+ 0.1(1 - 5(^1, 450)) + 0.17(1 - 5(^2, 200)) 
-I- 0.48 > (1-0.05). 



(37) 



Solving model (37l we obtain the optimal solution {Pi, 132) = (2.15,2.48), which gives 
(n*,n;) = ('450 + 2.15-%/450, 200 + 2.48 • V20o') « (496,235). 



We now solve the problem using the individual models as presented in ( |34[ ). The individual models 
for station 1 and station 2 are: 



min 5/3i 

/3i>0 



S.t. 0.34(1 - 5(^1, 450)) + 0.66(1 - a(/3i, 350)) > Vl - 0.05, 



(38) 



and 



mm 

/32>0 



3,92 



s.t. 0.04(1 - 5(;92, 300)) + 0.38(1 - a(;92, 200)) 



(39) 



+ 0.58(1 - a(/32, 150)) > Vl - 0.05. 
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The key scenarios for queue 1 and queue 2 are both high. The individual models above are equivalent 
to: 



and 



min 5/3i s.t. 0.34(1 - (5(/3i,450)) + 0.66 > Vl - 0.05, (40) 

/3i>0 



min 3/32 s.t. 0.04(1 - 5(/32, 300)) + 0.38 + 0.58 > Vl - 0.05. (41) 



The optimal to solutions to (|40|) and (|41j) are ^^ = 1.6, /S^ = 0.36. This gives ni = 450 + 1.6- V450 w 484, 
n2 ~ 300 + 0.36 • \/300 w 306. As noted in Table [sj the cost of the solution obtained via decoupling the 
stations is about 5% higher than the cost obtained using the joint model. 
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Appendices 

Proof of Lemma^ To show UB{j3,\) is strictly decreasing in A, it sufhccs to show the denominator of 
01 /O + 7 ( -Jr^ + wj=^ J , is strictly increasing in A for any /3 > 0. First note that 

2 \ 3A + 2/3VA 

P + 7 ' 



3xM 



is strictly increasing in A for any /3 > 0, since 



d 



3A+2ffyA 
3(A+/3VA) 



d\ 6(A + (i^/X) 

7'I>(a 



3 (a + I3y/X 



^^ >0 V/3,A>0. 



Thus, we only need to prove that \i V is non-decreasing in A for any /3, A > 0. Let 



We can show that /(/3, A) is strictly increasing by verifying that ^ > for any /3, A > 0. We have 



df ^ fi^ ^{a) ^ J A [0(a) + a>(a)a] ff 

^^ 4(A + /3VA)v/a7^'^(") VA + /3^/A Ha) 



From (42 1, it is obvious that It > for any /3, A > 0, if |f > for any /3, A > 0. We have 



dX 



y-2(/3yA + A)(l-^,^+ln(^^)) 



(43) 



The denominator of (43) is strictly positive, so it suffices to show 



We have 



,(^,A).-A + ('i+ Ml„A + _|,,0, VftA>0^ 



lim g{P, A) = 0, V/3 > 0, 

A— VCK) 
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and 



Since 



we have 



Thus we have ff < and 



dg ^ (i 
d\ 2\^/X 






< ln(l + a::),Va; > 0, 



-^^-ln(l + ^ ) <0, V/3,A>0. 
+ Va V vA. 



hm g (/3, A) =0, V/3 > 0. 

A— >-oo 



This proves g{f3, A) > 0, V;3, A > 0. 



Proof of Lemma CI With 



5(/3,A) = 



I3J\/(^X + I3VX) 



(12^-1) 12 (a + /3\/a) - 1 ' 

we have that g{(3, A) > and is continuous in (/3, A) for all /3 > 0, A > 10. (Below, we let A grow large. 
The value 10 here simply serves as a sufficiently large lower bound we use in establishing the desired 
result.) We have 



dgW, A) 
dp 



12/3^ 



x+pVx 



\+pV\ 



-l + 12(A + /3VAJj -1 + 12(^A + /3VA 
P>?l^ 



A+^v^ 



Evaluating 



dg{P.\) 

dp 



yields, after some algebra, 

y^ (-12/3271 + /?(-l + 12A) + 2VA(-1 + 12A)) 
-l + 12/3\/A + 12A 
or equivalently, 

-12/32yA + /3(-l + 12A) + 2V'A(-1 + 12A) = 0. 

Then we have 



= 0, 



/3 = 



-1 + 12A + Vl - 120A + 1296A2 
24\/A 



as the only positive root of this equation. Also we have ^(O, A) = 0, VA > 10, 



lim 5(/3,A) =0, VA > 10, 

/3— >oo 
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and g{/3, A) > 0, VA > 10. Thus /3 is the global maximizer of g {f3, A) for any A > 10. That is, 



Since 



we have 



5(/3,A)=maxg(/3,A), VA > 10. 



lim 5(/3,A)=0, 

A— fCXD 



lim supg(/3, A) = 0. 



This completes the proof. 



Proof of Lemma^^ To show that p4>{a) + 7$ (a) is strictly increasing in /3 for any sufficiently large A, it 
suffices to demonstrate that p4>{a) + 7$ (a) is strictly increasing in n for any sufficiently large A, since /3 
and n satisfy a linear relationship with a positive slope. Let h{n, A) — p(j){a) + 7<l'(a). We have 

dh{n,X) ^ -Xcj,{a) ^ ^..!^+^ ^ A(^(a)ln(A) _ (n - A)<^(a) In (A) 
dn V? 2ny/n n a-^/n 

First note that 

-X(j){a) , 71 + A -A0(a) + ?i<i>(a) 

n^ 2ny/n w' 

since n > A > 1. (Here, we take 1 as a lower bound on A since we establish a result for A that is sufficiently 

large.) Also since 

~X(j}{a) + n<P{a) -A0(O) + 7i$(0) 

9 — 9 ^ ^1 

we have 

—X(j){a) :^, ,n^- X 

It remains then to show that 

A0(a)ln(^) (n-A)0(a)ln(^) 

J^^W VnZ _ V m^ VnZ > Q V;3 > 0, A > 0. 

Some algebra demonstrates that this is equivalent to Xa < y'n{n — A) . As shown by equation (50) in [10] , 

a^/J-i/^^-L+Od/A). 
So we have < a < /? for sufficiently large A. Thus 

Aa < A/3 = \^{n - A) < ^/n{n - A), 
since A < n. D 

Proof of Lemma [^ All four terms in the formula on the left-hand side of inequality Q are non-negative 
for all /3 > 0, A > 1. Thus it suffices to show that h{f3, A) — p(j){a) + 7<i>(a) is uniformly bounded away 
from for all sufficiently large A and /3 > 0, since (/){■) is positive and bounded. From Lemmalsj we know 
h{f3, A) is strictly increasing in /3 for all sufficiently large A and h{0, A) — (j>{0) > 0. Thus we have that 
h{f3, A) is uniformly bounded away from for all sufficiently large A and all (3 > 0. D 



23 



Proof of Lemma^ Let UBn{n,X) = UB{^,\). Since n = X + /3y/X, to show UB{(3,X) is strictly 
decreasing in /3 for any A > 0, it is enough to verify that C/i?„(n, A) is strictly decreasing in n, or 
equivalently that 

[f/S„(n,A)] 



, 1 A n- X /$(a) 2 



is strictly increasing in n for any A, where a is given in (HI). Now, 

a[C/i3„(n,A)]-i _ -A /A + n\ /»t>(a)\ /n-A\ f ct>(a) + 'S>ia)a \ / da\ 
dn 3n2 VSnv^yV^Wy \ V^ J \ 0(a) 7 Vi9"/ ' 

Here, 

aa^-]nA^ 
dn a 

Since n > A, we have ^ > and so the third term in (44) is non-negative. Since $(a)/(/)(a) is strictly 



increasing in a and a is strictly increasing in n, we have that ^{a)/(j){a) is strictly increasing in n. Thus 

the first two terms are greater than 

-A A + n$(0) 
3n2 ^ 2n^0(Oy' 

which itself is strictly positive, completing the proof. □ 



Proof of Lemma 11 We prove that 

L L 

n (1 - c/i?(ft, AD) - n (1 - " (/3- ^D) 

1=1 i=l 

converges uniformly to in /3 as to — > cxo, via induction. Let UB (0i, A™) denote 1 — UB{Pi, A™) and let 
a(A, Af") denote 1 - a (/3„ A™). We first prove 

UBWi,XT)UB{(32, A^) - a(/3i, Ar)5(/?2, A^) 

converges uniformly to in /3 = (/3i,/32) as ?7i — >■ oo. From Theorem^ we have that this result holds 
separately for UB{l3i,X'P) - 5(/3i,A5") and C/S(/32,A^) - 5(/32, A^"), which implies that it again holds 
separately for ILB(/3i, A^) - a(/3i, A^") and CLB(/32, A^) - a(/32, A^). Also, note that 

UB{l3i,XT)UBil32,X?) - a(/3i,Ar)a(/32,Ar) 

= UB{Pi, K)UMh. K') - UB{(3i,XTM(32, X?) 

+UB{f3i,XTM/32,X^) ~ a(/3i, Ar)«(/32, A2™) 
= UBWiAT)iUB{/32,X2)-^i(32,X7)) 

+ {UBWi,XT) - «(/?!, AD) a(/32, A^). 
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Thus, 

lim sup (CLB(/?i, Ar)CLB(/32, A^) - a(/3i, Ar)5(/32, A™)) 
= lim sup { cm (/3i , Ar ) (cm (/32 , A^ ) - 5 (/32 , A^ ) ) 

)n->oo^>g 

+ (CLB(/3i, AD - a(/3i, AD) «(/32, A^)} 
< lim sup { Cm(/3i , Ar ) (Cm(/32 , A^" ) - a (/32 , A^" ) ) } 

+ lim sup{(Cm(/3i,Ar)-a(A,Ar))a(/32,A^)}. 

The above inequality holds because both terms are positive. Recall that both the Erlang-C formula and 
the JVLZ upper bound have range (0, 1]. This immediately implies that 

lim sup {UB{(3i,K) {UB{fi2, K) - a(/?2, A^))} - 



and 



Thus, we have 



lim sup {(CLB(/9i, AD - a(/3i, AD) a(/32, A^)} = 0. 



lim sup (CLB(/3i, ADCm(/32, A^) - a(/3i, ADa(/32, A^)) = 0. 

m^oo^>g 



Next, assume that ^K e {2, 3, . . . , L — 1}, 

K K 



n (1 - c/s(/3., AD) - n (1 - " (/5- ^")) 

1=1 i=l 

converges uniformly to in /3 as m — > oo. We now prove that for K + \, we have 

K+l K+1 

n (1 - uB{i3,, AD) - n (1 - " (z^- ^D) 

s=l i=l 

converges uniformly to in /3 as vn — > oo. 
As above, we have 

lf[ (1 - C/B(ft, AD) j Cm(/3K+i, AS+i) - [n (1 - " (/5^' ^D) ) a(/3K+i, A^+i) 
= ff[(l-C/B(ft,Ar))j Cm(/3K+i,A';j+i)- fn(l-C/B(ft,Ar))ja(/3K+i,A^+i) 

+ fn(l-C/B(A,AD)U(/3K+i,A?J+i)- fn(l-<i(A,AD)U(/3K+i,A'i3+i) 
= [f[a-UBi/3,,XT)yj (Cm(/3K+i,AS+i)-a(/3K+i,A™+i)) 

+ ( n (1 - C/S(/3., AD) - n (1 - " (A' ^D) J a(/3K+i, A^+i), 

\i=l 1=1 / 
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Thus, 



'K+l K+1 



lim sup <^ n (1 - ^^(/3„ AD) - n (1 - " (/3- ^D) 

< Jim^supnn(l-C/i?(/3„Ar))j (fZS(/3A'+i,A^+i)-^(/3K+i,A?+i))l 

+ lim sup|(fr(l-;7i?(/3„Ar))-n(l-a(A,Ar))U(fe+i,A^+i)l- 

Again we have that (ni=i (1 ^ UB{f3i,X^))] and a(/3/i-+i, A^_^j) are contained in (0,1]. Then by 
Theorem [8] and the induction assumption, we have 

Jim^suJ f f[(l - C/S(A,Ar)) j (Cm(/?K+i,A^+i) -a(/3K+i,A^+i)) I = 

and 

Km sup U n (1 - f/S(ft, AD) - n (1 - 5 (/3„ AD) ) ai^K+iAK+i) \ = 0. 
This imphes that 

K+l A'+l 

n (1 - c/i?(A, AD) - n (1 - " (/^- ^D) 

i=l 4=1 

converges uniformly to in /3 as m — ^ oo, establishing the result. D 



Proof of Lemma 15 The inequality /3^ > /3|^ for all A > is immediate from the definitions of these 



quantities and the fact that q;(/3, A) < UB{/3, A) for all A, /3 > 0. 

Denote the Halfin-Whitt approximation defined in ([3]) in Section^ as anwi'), and its inverse function 
as ctJJ^r{■). The function anwi') ^^ strictly decreasing and this implies that ct^^r{■) is strictly decreasing. 
For any A > 0, we denote the inverse of UB{-, A) as UB^^{-). UB{-,\) is strictly decreasing for any A > 0, 
and this implies that UB^^{-) is strictly decreasing for any A > 0. By Janssen et al. [TU], we have 

lim UB{I3, A) = anwiP), V/3 > 0. 

A— ^oo 

Together with the monotonicity of UB{/3, A) in A for any /3 > 0, we have 

lim UB-^{x) = a^\y{x), Va; > 0. 

A— >oo 

Since limA^oo ^a = e > 0, there exist ^, u > 0, such that < ^ < eA < u when A is large enough. Also, 
since a^yi/(-) is a continuous function, by Theorem 



14 



we have 



lim sup \UB^'^{x) - aij\y{x)\ = 0. 

A->-oo 2:>0 

This gives 

hm UB^\e^)^a^\^ie), 

A— >-oo 

which implies that liniA-i-oo P\ exists and we denote it by /3* . Then analogous to the proof of Theorem 
10 in Section k51 we can show that the limit of f3^ exists, and liniA-i-oo f^x = limA-s-oo f^l 



3f 
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